Word Segmentation on Discovered Phone Units With Dynamic Programming and Self-Supervised Scoring
نویسندگان
چکیده
Recent work on unsupervised speech segmentation has used self-supervised models with phone and word modules that are trained jointly. This paper instead revisits an older approach to segmentation: bottom-up phone-like unit discovery is performed first, symbolic then top of the discovered units (without influencing lower level). To do this, I propose a new model, chain two segment speech. Both use dynamic programming minimize costs from network additional duration penalty encourages longer units. Concretely, for acoustic discovery, duration-penalized (DPDP) contrastive predictive coding model as scoring network. For segmentation, DPDP applied autoencoding recurrent neural The chained in order gives comparable results state-of-the-art joint English benchmark. On French, Mandarin, German Wolof data, it outperforms previous systems ZeroSpeech benchmarks. Analysis shows system segments shorter filler words well, but might require some external top-down signal.
منابع مشابه
Word and phone level acoustic confidence scoring
This paper presents a word level confidence scoring technique based on a combination of multiple features extracted from the output of a phonetic classifier. The goal of this research was to develop a robust confidence measure based strictly on acoustic information. This research focused on methods for augmenting standard log likelihood ratio techniques with additional information to improve th...
متن کاملWaterloo at NTCIR-3: Using Self-supervised Word Segmentation
In this paper, we describe the system we use in the NTCIR-3 CLIR (cross language IR) task. We participate the SLIR (single language IR) track. In our system, we use a self-supervised word-segmentation technique for Chinese information retrieval, which combines the advantages of traditional dictionary based approaches with character based approaches, while overcoming many of their shortcomings. ...
متن کاملNonparametric Bayesian Semi-supervised Word Segmentation
This paper presents a novel hybrid generative/discriminative model of word segmentation based on nonparametric Bayesian methods. Unlike ordinary discriminative word segmentation which relies only on labeled data, our semi-supervised model also leverages a huge amounts of unlabeled text to automatically learn new “words”, and further constrains them by using a labeled data to segment non-standar...
متن کاملSemi-supervised Chinese Word Segmentation for CLP2012
Chinese word segmentation (CWS) lays the essential foundation for Mandarin Chinese analysis. However, its performance is always limited by the identification of unknown words, especially for short text such as Microblog. While local context are helpless in handling unknown words, global context do manifest enough contextual information, and could be used to guide CWS process. Based on this moti...
متن کاملSemi-supervised Chinese Word Segmentation based on Bilingual Information
This paper presents a bilingual semisupervised Chinese word segmentation (CWS) method that leverages the natural segmenting information of English sentences. The proposed method involves learning three levels of features, namely, character-level, phrase-level and sentence-level, provided by multiple submodels. We use a sub-model of conditional random fields (CRF) to learn monolingual grammars, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE/ACM transactions on audio, speech, and language processing
سال: 2023
ISSN: ['2329-9304', '2329-9290']
DOI: https://doi.org/10.1109/taslp.2022.3229264